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ABSTRACT 



This thesis pertains to the area of systematic error 
recovery for the National Supply System. By techniques 
applied to real stock numbers a correction algorithim is 
successfully developed. The main research area for this 
tliesis is that of algebraic coding theory , especially 
Reed-Solomon codes and their application for the National 
Supply System using finite field theory and the RS code over 
GF( 11). 

The findings of the research are also discussed for a 
possible database interface. This product of the study may 
be used to assist supply officers and other officials devel- 
oping an error correction mechanism in order to have a more 
reliable and efficient supply system. 
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I. INTRODUCTION 



A. BACKGROUND 

After World War II, a very important issue in the war 
was realized which, if used effectively, would bring the 
victory: Logistics. 

One result, arising out of the second World War, has 
been the existence of two super powers in the world. Each of 
them has its own political, military and economic view- 
points. As a result of this, the NATO (North Atlantic Treaty 
Organization) has been established by the United States and 
some of the European countries who believe in the power and 
necessity of freedom. Since then, these members of NATO have 
worked together to provide required materials, military aids 
and education to each other, in order to develop a powerful 
and strong defense system against possible future threats. 

During this development, a supply system has been needed 
to provide countries with required items in some designated 
time. The Supply System of the United States has been 
approved as the Supply System of NATO with some minor 
changes. Nowadays, most of the NATO members use this system 
as their own National Supply System. 

However, from the establishment of NATO to the present, 
there has been no study about providing an error detection 
and correction mechanism in order to have a more reliable 
and efficient system. 

The fundamental problem of communication is that of 
reproducing at one point, either exactly or approximately, a 
message, selected at another point. The significant aspect is 
that the actual message is a selected message from a set of 
possible messages. The system must be designed to operate 
for each possible selection, not just the one which will 
actually be chosen [Ref. 1). A National Stock Number (NSN), 
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can be considered as a message for this purposes. This 
thesis develops an error correction coding system for MSN' s 
which allows correction of a broad class of errors. 

B. GENERAL VIEW OF A SUPPLY SYSTEM 

The existing supply system is a kind of bridge between 
resources and requirements by providing material at the 
required places at the required times. Therefore, the speed 
and accuracy of the supply are very important. In order to 
identify a specific item in the stock, it is necessary to 
provide an identifier and material classification. 

1. Material Classification 

There are over four million items in the United 
States Department of Defense Supply System. The Navy Supply 
System alone stocks over one million items. If we consider 
all NATO member's stocks, the size of the system would be 
staggering. For proper requisitioning of an item, a common 
language has been developed: the Federal Catalog System 
( FCS ) . The most important component of this system is the 
NSN. 

2. National Stock Number f NSIH 

A NSN is a 13 digit number assigned by the Defense 
Logistic Services Center (DLSC) to identify items of 
material in the Supply Distribution System of the United 
States. It consists of a four digit Federal Supply 
Classification Code (FSC), a two digit National Codification 
Bureau Code (BC), and a seven digit National Item 
Identification Number (NIIN). The NIIN part of a NSN is the 
most significant part and is used to uniquely identify each 
NSN item in the Federal Supply Distribution System [Ref. 2] 
Construction of a NSN can be seen in Figure 1. 1. 
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Figure 1. 1 Construction of a MSN 
3. Nato Stock Number 

As mentioned earlier in this chapter, the main logic 
for the Nato Supply System is almost the same as in the 
National Supply System of the United States. The only 
difference is the names of the three parts existing in NSN. 
The corresponding names are explained in Tacble I. 

The main difference is a two digit Country Code 
(also called Source Code) assigned for each NATO member 
instead of National Codification Bureau Code in the National 
Supply System. As an example, 00 and 01 represent the United 
States in this classification. 

C. PURPOSE OF STUDY 

The subject area of this thesis, the theory of error 
correcting codes, started as a subject in Electrical 
Engineering with Shannon's classic papers in 1948 and 1949. 
It has since become a mathematical topic and a part of the 
fascination has been the use of many varied tools to solve 
practical problems in coding. The possibility of applying 
finite field theory to problems in discrete communication 
was recognized in the late 1950' s. One such class of codes 
which is very famous and popular in this area is the 
Reed-Solomon ( RS ) codes. These codes are chosen to built an 
error recovery mechanism in this thesis because of their 
effectiveness and ease of use. 
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The goal of this thesis is to provide an error recovery 
method for the existing Supply System using recent tools in 
the Computer Science area, such as data base applications, 
and combining them with the Reed-Solomon codes application. 

In an effort to assist the' reader in simplicity and 
comprehension of this abstract subject, this author has 
taken the pertinent information vital to the thesis and 
created a chapter for each. After the general view of the 
supply system is introduced in this chapter, the necessary 
fundamentals of finite fields and algebraic coding theory 
are presented in Chapters II and III. In Chapter IV the 
presentation of Reed-Soloraon codes is made and the implemen- 
tation theory is discussed in Chapter V. Implementation of 
the RS codes to the existing system is presented in Chapter 
VI. Finally, a possible interface between users and an 
existing database is described in Chapter VII using the 
implementation of Reed-Solomon codes discussed in Chapter 
VI. Conclusions of the thesis are represented in Chapter 
VIII. 



TABLE I 

CORRESPONDING NAMES IN NATO SUPPLY SYSTEM 



Name Field in NSN 


Corresponding in Nato Supply System 


Federal Supply 
Classification (FSC) 


Nato Supply Classification 
(NSC) 


National Codifica. 
BUREAU Code (BC) 


Nato Source (Country) Code 
(SC) 


National Item Ident. 
Number (MI IN) 


Nato Item Identification 
Number (NIIN) 
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II. ALGEBRAIC CODING THEORY 



A. CODING WITH ALGEBRAS:. LINEAR CODES 

1 . Vector Spaces 

A vector space V^(K) of dimension n is a set of n 
long vectors V = (v^, v^, v^, . . . , v^) of elements in some 

field K and V forms an additive commutative group that also 
admits multiplication by scalars or elements from the field 
K. The rules for this- scalar multiplication are; 

Ifa, b^K,Vj^(-V then 

• a*V f V ; l*v = V ; a*(b*v) = (a*b)*v 

• a*(V]^ ^ ~ a*V2 + ^*'^2 

• (a + b)*v = a*v + b*v 

Addition of two vectors v^ and V2 is by component- 
wise addition by the addition defined in the field K. As an 
example, if Vj^(K) is the set of binary n-tuples, then K is 
the scalars, the elements 0 and 1 of the field of 2 
elements. 

2 . Related Definitions 

Linear Dependence : If V is a vector space and K is 

the scalar field, then a set of e vectors, V2/V2,. • •• 

are said to be linearly dependent over K if there exists a 
set of scalars, C2/C2,. • • /Cg, not all of them zero, such 

that; 



"=1 * "^1 ^ "=2 * ^2 



• + Cg * Vg = 0 



Linear Iiidependence : If the set of e vectors is not 

linearly dependent, i. e. , there do not exist scalars 
C2,C2,. . -/Cq such that 



. a. a. 

1 = 1 

then the vectors are said to be linearly independent. 
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Dimension of V : Let in be the largest number of 

linearly independent vectors of V. The dimension of V is m. 

B. GEMERATOR MATRIX 

Let A be an (n,k) group code. Choose k linearly indepen- 
dent vectors of A. Write these out as rows. The k rows form 
a (k*n) matrix G called the generator matrix for A. Every 
code vector is some linear combination of the rows of G. 

This matrix description serves as a compact list of all 
code vectors. For example a (13,11) code can be described by 
a 11*13 t-ary matrix while the code contains t^^ vectors 
where t is the size of the underlying field K. 

1. Alternate Descriptions. Parity Check Matrix 

If a = (aj_) and b = (b^^) then , we introduce the dot 
product (scalar product) of the two n-long vectors a and b, 
( a. b ) as 

n 

( a. b) = ^ ^i- ^i (2.1) 

i = l 

It should be remembered that, sums and products are 
performed in K, which was introduced as the underlying 
field. If a. b = 0 then a is said to be orthogonal to b. 

Mow, let us consider a matrix H, which is a ( n-k * 
n) matrix, whose rows are linearly independent. 

Let be an orthogonal space to H, that is, a ( 

--> a. u^ = 0, j = l,2,. . . ,n-k where u^ is a row in V. In 

matrix notation aH = 0 ( a is a l*n vector here; T signifies 
transpose). H is called the parity check matrix to V^. We 
first note that if a is orthogonal to the vectors ( rows) of 
H, it is orthogonal to the vector space spanned by these 
vectors (by forming linear combinations of these vectors). 
We will illustrate by the following example how a parity 
check matrix can be obtained: 
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Example 2. J.: 



Let H be defined as. 



I 1 1 1 0 0 1 0 I 
H=|1100101| 

I 1 0 0 1 0 1 0 I 

In this example H is a (3*7) matrix. Then is a 
(7,4) group code containing the all one vector. We can also 
say that, the (13,11) code discussed in this section 
requires only a (2*13) parity check matrix for its descrip- 
tion. An example of an important code obtainable using 
vector space or matrix methods is the Hamming code. 

C. CODING WITH FINITE FIELDS: BINARY CYCLIC CODES 

The most important group codes are the cyclic ones. 
These codes are distinguished by their ease of encoding and 
by the highly algebraic mechanizable approach to their 
decoding.'; 

The first discovery in this area was made by 
Bose-Chauduri and Hocquenghem independently. But the 
approaches, introduced by these two, has minimal error 
correcting capability [Ref. 3] 

Nowadays, there are many approaches to the exposition 
and treatment of these codes. However, certain mathematical 
knowledge and tools are required. For that reason, we intro- 
duce some more details before examining a most useful set of 
codes in this area , Reed-Solomon codes, in Chapter IV. 

1. Finite Difference Equations 

One of the most common approaches to cyclic codes is 
via finite difference equations. This approach represents 
one particular way of encoding that results in systematic 
cyclic codes. A major advantage of this approach is that, 
the encoding of cyclic codes becomes very natural and the 
decoding procedures emerge as a direct consequence. 

Let us consider a finite difference equation of 
degree k, with constant coefficients in the field F: 
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+ . 



0 



^n+k '^l^n+k-1 ^2^n+k-2 

where f F , n = 0, 1,2,3,. 



^k-l^n+1 






We wish to solve the above difference equation, 
given k initial values; aQ, a-^, a 2 , ■ ■ ■ /^k-i ( A general 

approach to the solution is illustrated in the follov/ing 
example. 



Example 3.Z: In this example we show that, the 
codeword ( or output word) can be obtained by using the given 
finite difference equation and the given initial values. We 
say, + a^^ = 0 is the finite difference equation 
and aQ = 1, a^^ = 0, a 2 = 1 are the initial values. The 
required operation could be performed over, F, the field of 2 
elements, in following way; 



^3 ~ ^0 ^1 ~ 



a^ - ai + a2 



^5 ~ ^2 ^3 



ag - as + a^ 



a-7 — B.A Sc — 



^8 “ ^5 ^6 “ 



^9 “ ^6 ^7 



a^Q— a-7 + a© — 
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1 + 0 
0+1 
1+1 
1+1 
1 + 0 
0 + 0 
0+1 
1 + 0 



1 

1 

0 

0 

1 

0 

1 

1 



- - > a- 



--> a. 



--> ac 



--> a. 



- - > a 



0 



--> a- 



-- > a- 



--> a- 



and as it turns out, the sequences a^^ obtained by this 
method are ultimately periodic. 

Then the codeword obtained by performing this opera- 
tion is 1011100 repeated periodically. We will use shift 
registers to provide the encoding process for this example. 
For the above problem, it is clear that, a^^_^ = aj ^_2 + ^k-3 
and we also know the initial values. All we need to do is. 
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til 

construct a cycle in v/hich the k element of the code will 
calculated just by adding previous register values. We use 
the given formula, when building the shift register diagram 
as represented in Figure 2. 1. 






+ 


— > 


^k-1 




^k>2 


-- 


• > 


^k-3 






( 1 )‘ 




( 2 ) 






( 3 ) 



->output 



STEP #: 



Initial 

1 

2 

3 

4 

5 

6 

7 

8 



POSITION #: 
#1 #2 #3 



OUTPUT 



1 

1 

1 

0 

0 

1 

0 

1 

1 



0 

1 

1 

1 

0 

0 

1 

0 

1 



1 

0 

1 

1 

1 

0 

0 

1 

0 



1 

0 

1 

1 

1 

0 

0 

1 



> ar 



> a- 



^ 3. ^ 



- — — > a- 



> a. 



— — — > ac 



> a. 



> ar 



CODEWORD (OUTPUT) : 1011100. 



Figure 2.1 Shift Register Application 
D. (M,K) CYCLIC CODE 

We have, via the recursion polynomial f(x), a natural 
mapping of k-tuples into n-tuples, given by the recursion 
rule (or diffex'ence equation); 
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aQ, a^,. . . > 3q, . . ■ , • • --^n-1 

where the are given by the difference equation. This 

mapping is linear, i. e. , preserves the group structvire. The 

number of distinct initial a's is 2^ and the linear property 
states, if a arises from ( Hq, a^, . . . , and b from 

(bQ, b^,. . . , then a+b arises from ( aQ+bQ, a^^ + b-j^, . . 

• ' ^k-1'’’^k-l ) • Thus we have generated an (n,k) group code. 

This particular group code is cyclic in the sense that, if a 
= ( Sq, a-j^,. . . is a codeword, any word obtained by 

cyclically shifting .a by a position to the right or left 
along this sequence is also a codeword. That is, (a-j_, a 2 , . . 

• / ^n- 1 ' ^0 (^ 4 ' ^ 5 ^ • •' ^0 ' ^1' ^2' ^3^' etc. , are also 

codewords. 

The conclusion is straightforward. If we choose any 
polynomial f(x) of degree k with coefficients in GF(2) and 
with no repeated roots, which divides but not x™'*’^, m 

< n, then by forming the associated difference equation, we 
have a means of generating an (n,k) group code. 

o 

Example Let f(x) = X + X + 1. f(x) is a factor of 

7 

X . + 1. The associated difference equation will give rise to 

a (7,3) cyclic code, where for example 111 > 1110010. 

Any other nonzero condition yields a codeword which is a 
cyclic shift of the given codeword. 

Figure 2.2 shows how we produce this expansion. All we 
need to do is, sum up the last two digits, write down this 
sum as a new digit (to the left side), and shift all digits 
one position to the right. The codeword is obtained from 
first digit to seventh digit , i. e. , from left to right. 

Cyclic codes are useful for the ease of the encoding 
processes as they are easily mechanized by shift register 
devices. The cyclic property clearly minimizes storage 
facilities. Further these codes are easily analyzable, and 
also have very efficient decoding properties. The decoding 
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Figure 2. 2 Creation of a Codeword 

also proceeds via a shift register algorithm. We omit the 
details of the general shift register decoding here and 
illustrate the decoding in a specific instance hater. 
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III. FINITE FIELD THEORY 



A. BACKGROUND 

Finite or Galois fields (named after the nineteenth 
century French mathematician Evariste Galois) play many 
important roles in signal processing and information theory 
applications. However, in this thesis we are concerned only 
with their use in the construction of Reed-Solomon error 
correcting codes. We- begin with the general definitions in 
order to understand the pertinent facts regarding finite 
fields. 

B. CONSTRUCTION 

1. Definitions 

A field is a set of elements, including 0 and 1, any 
pair of which may be added or multiplied, (denoted by + or 
*, respectively), to give a unique result in the field. 

The basic building blocks are the prime fields Fp 
, where p is a prime number. Fp is the field whose elements 
are 0,1,. . . ,p-l, and arithmetic is performed modulo p. The 
additive structure is that of the vector space defined in 
the previous chapter. 

The addition and multiplication are associative and 
commutative, and the multiplication distributes over addi- 
tion in the usual way: u*( w+v)=u*w+u*v. Every field element 
u has a unique negative -u such that u+(-u)=0. Every nonzero 
field element u has a unique reciprocal field element 1/u, 
such that u*( l/u)=l. For every field element u, 
0+u=u=l*u, and 0*u=0. Thus the numbers 0 and 1 are the addi- 
tive and multiplicative identities, respectively. 

The order of the field is the number of elements in 
the field. If the order is infinite, the field is called as 
an infinite field and if the number of elements is finite, 
v/e call the field a finite field. 
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C. MULTIPLICATIVE STRUCTURE 



Let Eg be a finite field with q = elements. The 

nonzero elements of Fg form a commutative group, (Fg)", of 
order q-1, which is in fact a cyclic group under multiplica- 
tion. Finite fields can be constructed as polynomial alge- 
bras by defining multiplication as polynomial 
multiplication. If we start with two polynomials f(x) and 
g(x) of degree less than n, then their product f(x)*g(x) 
when formed in the usual way is not necessarily a polynomial 
of degree less than n. In order to satisfy closure, we write 
[f( x)*g( X)] modulo P(x) where P(x) is an irreducible polyno- 
mial of degree n over the field. An element of multiplica- 
tive order q-1, that is, a generator of the group ( Fg) , is 
called a primitive root. 

k 

It thus follows that every element a in (Fg) satisfies 
a*^”^ = 1, and so every element in F^ satisfies a'^ = a. If F 
is viewed as a subfield of Fgm for some m, then the equation 
characterizes Fg, that is, a*^ = a iff a is an element of Fg. 
In other words, the multiplicative group of nonzero field 
elements is cyclic, i. e. , it is a group that consists of all 
the powers of one of its elements, a. Multiplication can 
alternatively be defined as a^*a^ = a^’’’^ where i + j is 
computed modulo (p*^-!) and a is the generator of this group. 



D. THE MINIMAL POLYNOMIAL 

The minimal polynomial of a is defined to be the monic 
polynomial f(x) of least degree with coefficients in Fp such 
that f( a) = 0. Over F f(x) is irreducible, but in the 
larger field F^ f(x) factors into linear factors: 



f( x)=( x-a)( x-aP) . 



. ( x-aP 



k-1 

) 



(3. 1) 



where k is called the degree of a and thus the degree of 
f(x) is the same as the degree of a. 
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If a is a primitive root in F m the minimal polynomial 

of a is called a primitive polynomial over F . It is often 

P 

convenient to reverse this process and use a pt'imitive poly- 
nomial to construct a field. 

1. An Example Of The Creation Of a Field 

Consider the Galois field GF(2^). It has 2 ^ elements 
and may be constructed as the field of polynomials over 
GF( 2 ) modulo the irreducible polynomial 1= x + x^. If we let 
b represent a root of this irreducible polynomial, then it 
is also a primitive element of the field. Field addition of 
the elements is bit-by-bit modulo 2 addition while multipli- 
cation of the elements is described using the primitivity of 
the element b. Thus, b^*b^=b^'*’^ where i + j is reduced modulo 
15, if necessary. 

After defining the addition and multiplication over 
GF(2^), we are ready to create the field. First, we need to 
have a primitive polynomial which will be order of 4. We 
will select the primitive polynomial as f(x)= x'^+x+l. The 
operations which are required to create .the field elements 
can be performed as : 

f(b)= 0 ==> b^ = b + 1 

b^ = b^ + b 

b^ = b^ + b^ 

b”^ = b^ + b + 1 



b 

b 

b 

b 



14 

15 

16 
17 




1 
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b^^ = b^^ + b^^ = b + 1 = b"^ 



The field elements are listed in Table II. 
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TABLE II 



REPRESENTATION OF GF(2'^) 



Field element 


b Polynomial 


4-Tuple 


0 




0000 


b° 


1 


0001 


bl . 


b 


0010 


b^ 


b^ 


0100 


b3 


b^ 


1000 


b^ 


b +1 


0011 


b5 


i+ b 


0110 


b^ 


1^ + b^ 


1100 


b^ 


1^+ b +1 


1011 


b8 


b^ +1 


0101 


b^ 


# + b 


1010 


bio 


lD+ b +1 


0111 


bll 


3 2 
l3 + b+ b 


1110 


bl2 


If? + ]d + b +1 


1111 


bl3 


b^ + b^ +1 


1101 


bl4 


b^ +1 


1001 
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IV. REED-SOLOMOM CODES 



A. BACKGROUND 

Cyclic codes over an alphabet of symbols were origi- 
nally introduced by Gorenstein and Zierler (1961) along with 
an effective error correcting procedure. They also pointed 
out that the Reed-Solomon burst error correcting code may be 
considered as a code in this class [ Ref. 4] 

Reed-Solomon code-s are (q^-l,m) cyclic codes over GF(q^^) 
and were originally defined by Reed-Solomon differently than 
the Gorenstein and Zierler formulations. The code RS(n,t) 
is called a k error correcting Reed-Solomon code of length 
n. In this definition t is the number of information symbols 
in the codeword and k < n-t. These codes can correct both 
random and burst errors over a communication channel and 
hence are ideal for the numerous real time and reliable 
communications demanded by these applications. The 
complexity of RS encoders and decoders are proportional to 
the error correcting capability of the code, the speed of 
the decoding and the interleaving depth used. 

The code RS(n,t) consists of all vectors C = (Cq,C-l,. . 

. , Vj^(Fq^) such that the corresponding polynomial 

C(x) = ^ has the form C( x ) = ( x - a) ( x - a^ ) . . .(x - 

2 1 

a )I(x), where I(x) is a polynomial of degree < n-l-2t over 
Fq”'. I(x) is the polynomial of the information symbols 
while (x - a) (x - a^). . .(x - a^^) is the polynomial of 
the check symbols. The code has parameters: 



Length : n 

Dimension : k 

Min. distance: d 



q'^-l 
n - 2 1 
2t + 1 
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B. GENERAL ENCODING PROCESS 



As mentioned earlier, Reed-Solomon codes are cyclic 
codes over GF(q^'^). Let aQ, a-j_,. . -/^m-l elements of 

GF( q^) , then the code is defined in a non- systematic way as 
follows: 



a(x) = aQ + a^x + . . . + (4.1) 

We let c be a (q^-1)^^ root of unity. We define b = (bg, 
b]_,. . .,bgn_ 2 ) to be the vector whose coordinates are given 

by bj^ = a(c^) , i = 0, 1,. . ., q^“^. The code map ( aQ, a 2 , . 

• -'^m-l^ ^ (^0' ^1'* • gives rise to a (q'^-l,m) 

code with maximum distance d>q^-l-(m-l)= q^ - m which 
will correct errors up to 

e < (q^ - m)/ 2 (4.2) 



Here the length of the codeword is q^ - 1 and the field 
in which the symbols lie is GF(q^). For any vector a in 
Vgn_ 2 (q^), the codeword associated with a(x) ( ga(x) is 
written very simply; 

ga(x)=CQ + c^x +. . . + ( c^n. )x^^"2 (4.3) 



The recursion rule or polynomial associated with the code 
is; 




The Reed-Sol'omon code may thus be encoded systemati- 
cally. 

1. General Encoding Algorithm 

As discussed in Chapter III, an (n,k) code can be 
generated by a polynomial of degree n-k. If the polynomial 
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is primitive of degree r and n = 2^-1, the code can be 

encoded and decoded with primitive shift registers [Ref. 5]. 
Hence, we restrict our attention solely to the case of prim- 
itive polynomials. We illustrate the required algorithm for 
encoding of RS codes using primitive polynomials as follows: 

1. Represent the message as a polynomial. Call this poly- 
nomial m(x). Degree of m(x) is at most k. 

2. Multiply m(x) by x^"^’ to shift the message digits to 
the far right. 

3. Calculate the remainder when x^”^^ ni(x) is divided by 
p(x). (Note that p(x) is the primitive polynomial). 

4. Form the code polynomial as the sum x^”^ x) + r(x), 
(r(x) is the remainder of the division). The check 
polynomial is actually then a multiple of p(x). 



C. GENERAL DECODING PROCESS 



The problem for decoding is to find the error positions 
and symbol changes. There are two decoding procedures avail- 
able to do this. The first of them requires finding the 
coefficient matrix which will determine the symmetric func- 
tions of the error positions, @, or finding the appropriate 
augmented matrix and computing the syndromes, ( S j ) , for j > 
dQ. The second method will not be addressed in this thesis. 

1. General Decodincf Algorithm 

The decoding of an RS code is composed of the 
following six steps. For this algorithm, the input is the 
received vector R and the output is the codeword C. 

1. Compute the syndromes, using the equations: 




j = 0, 1,2. . . 2t-l. ( 4. 5) 



2. Perform Euclid’s algorithm on x^^ and S( x) = S^ + S 2 X 

? t* — 1 

+ . . . + S 2 .^x . Stop as soon as the degree of the 

remainder rj < t. Use the same algorithm to determine 
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3. 

4. 

5. 



the coefficients of the error locator polynomial @(x). 
Then calculate w(x), the error evaluator polynomial, 
using the fact, @(x).S(x) = w(x) (mod g(x)), where 

g(x) is an irreducible polynomial over the field. 

Find B = { b ( Fgm : @(b) = 0}, by trial and error 

method. 

For each b (• B , set Ej^ = w(b)/@'(b) where (§>'(b) is 
the formal derivative of @(b). 

For each i = 0,1,. . . ,n-l , set 



Output the codeword by subtracting the error vector 
from the received vector 



0 if a’^ 








if a“^ (• B. 



C 



^ 0~^0 ' ^ 1”^1 ' ■ 
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V. IMFLEMEMTATIOM THEORY 



A. BACKGROUND 

In this chapter we look at the theoretical concepts 
behind the creation of a particular finite field and 
construction of a specific RS code. As mentioned in Chapter 
1 , a NSN (National Stock Number) consists of three basic 
units and one of them is NIIN (National Item Identification 
Number). It has also mentioned that the NIIN part of a 
stock number is unique for each supply item. Thus, we will 
consider only the NIIN part of the stock number when we 
construct our coding scheme in the subsequent chapter. 

There are seven digits in the NIIN part of the stock 
number. These symbols will be the information symbols in the 
codeword. We also, in this chapter, discuss the number and 
the types of the errors we consider for the correction 
process. That is, we decide how many error check digits we 
must have in the codeword. We also introduce the order and 
construction of the finite field we use for our particular 
application. Since the symbols are not binary, the 
Reed-Solomon codeword we will design contains symbols which 
lie in a larger field than GF(2). In particular, since the 
symbols are digits, the field will have to have at least 10 
elements. As the size of field must be a power of a prime 
number, we shall use a field of 11 elements. 

We also, in this chapter, discuss and decide upon the 
generating polynomial g(x) and find a primitive root of 
unity (generator), a, for our construction of the 
Reed-Solomon code. 

B. ERROR CHECK DIGITS 

Before going further with the discussion and construc- 
tion of the finite field we shall need, it is necessary to 
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find out the number of error check digits which will be used 
in the codeword. Our coding scheme will be an interface 
between users and computers. In order to figure out the 
number of check symbols, we consider the common type of 
errors which are most likely to be made by humans. V7hen 
dealing with humans, three types of errors are common 
[Ref. 6] 

1. People have a tendency to interchange adjacent digits 
of numbers; for example 67 becomes 76. 

2. Another common error is to double the wrong one of a 
triple of digits, two adjacent ones of which are the 
same; for example 667 becomes 677 merely by a change 
of one digit. 

3. A third kind of simple error is just the substitution 
of one symbol for another. 

These are the most common errors in arithmetic. We will 
provide for the correction of two errors in our application. 
This will handle the first one of these common errors as 
well as the other errors mentioned. So, the maximum 
distance required as discussed previously can be found by 
applying the equation; 

dQ > q^ - m = q^ - 7 ‘ (5.1) 

where m represents the number of information symbols in the 
codeword. In the NIIN portion of the message, there are 7 
digits present. 

Since we want to correct up to two errors, according to 
the equation 4. 2 which we introduced in Chapter IV; 

e < q^/2 - m/2 > 2 ==> q^ >11 (5.2) 

Accordingly we have found from the above equation, we 
will choose GF(ll) for our application. Here, q = 11 and n = 
1, and the computations will be in arithmetic modulo 11. To 
make this approach clear, we illustrate addition and multi- 
plication over GF(ll) in Table III and Table IV respec- 
tively. The symbol A is used to represent the digit 10. 
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Our findings from equation 4. 2 show that, we will have 
four error check digits in the codeword. One of these ( the 
last one on the right) is the parity check digit. Parity 
checking for this type of coding scheme provides the total 
error amount occuring" in the codeword computed modulo 11. 
It is obvious that, if this error amount is zero, there are 
several possible cases to be considered. Either no error 
occured, or there are some interchanged bits in the code- 
word, or the total error amount is a multiple of 11. 
Therefore we provide a decoding process by calculating all 
the syndromes first. Then the error correction procedure is 
as described in the decoding procedure of Chapter IV and 
more precisely in Chapter VI. 

C. GENERATING POLYNOMIAL 

As mentioned in the previous chapter, the generating 
polynomial for the RS code is described by the equation: 

g(x) = (X - b)(x - b^). . .(X - b^^"!) (5.3) 

where t is the error correcting capability of the RS code 
and b is a primitive root of unity (generator). The signifi- 
cant point here is to find the generator element first in 
order to determine the generating polynomial according to 
equation 5.3. Since the RS code is cyclic; the powers of a 
primitive root should generate all the roots of unity. In 
other words, the powers of the generator should generate all 
the nonzero elements of the finite field, namely GF(ll). 

It can be determined that 2 is a generator, by trial and 
error method. For GF(ll) the verification that 2 is a 
generator is shown in Figure 5. 1. Since we have the gener- 
ator 2 and we have already decided the error correcting 
capability of the code, (2 error correction) we are now able 
to calculate the generating polynomial. 
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TABLE III 

ADDITION OVER GF(ll) 



+ 


0 


1 


2 


3 


4 


5 


6 


7 


8 


9 


A 


0 


0 


1 


2 


3 


4 


5 


6 


7 


8 


9 


A 


1 


1 


2 


3 


4 


5 


6 


7 


8 


9 


A 


0 


2 


2 


3 - 


4 


5 


6 


7 


8 


9 


A 


0 


1 


3 


3 


4 


5 


6 


7 


8 


9 


A 


0 


1 


2 


4 


4 


5 


6 


7 


8 


9 


A 


0 


1 


2 


3 


5 


5 


6 


7 


8 


9 


A 


0 


1 


2 


3 


4 


6 


6 


7 


8 


9 


A 


0 


1 


2 


3 


4 


5 


7 


7 


8 


9 


A 


0 


1 


2 


3 


4 


5 


6 


8' 


8 


9 


A 


0 


1 


2 


3 


4 


5 


6 


7 


9 


9 


A 


0 


1 


2 


3 


4 


5 


6 


7 


8 


A 


A 


0 


1 


2 


3 


4 


5 


6 


7 


8 


9 
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TABLE IV 

MULTIPLICATION OVER GF(ll) 



•k 


0 


1 


2 


3 


4 


5 


6 


7 


8 


9 


A 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


1 


2 


3- 


4 


5 


6 


7 


8 


9 


A 


2 - 


0 


2 


4 


6 


8 


A 


1 


3 


5 


7 


9 


3 


0 


3 


6 


9 


1 


4 


7 


A 


2 


5 


8 


4 


0 


4 


8 


1 


5 


9 


2 


6 


A 


3 


7 


5 


0 


5 


A 


4 


9 


3 


8 


2 


7 


1 


6 


6 


0 


6 


1 


7 


2 


8 


3 


9 


4 


A 


5 


7 


0 


7 


3 


A 


6 


2 


9 


5 


1 


8 


4 


8 


0 


8 


5 


2 


A 


7 


4 


1 


9 


6 


3 


9 


0 


9 


7 


5 


3 


1 


A 


8 


6 


4 


2 


A 


0 


A 


9 


8 


7 


6 


5 


4 


3 


2 


1 
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The generating polynomial can be obtained with reference to 
the equation 5. 3 and Figure 5. 1 in the following way: 

g(x) = (X - b^)(x - h^){K - b^) 

= (X - 2)(x - 4)(x - 8) 

= (x^ - 6x + 8)(x - 8) 

So., the generating polynomial is described by the equa- 
tion; 

g(x) = x^ - 3x^ + X + 2 (5.4) 

where sums and products are computed modulo 11. 




Figure 5. 1 Verification of the generator b = 2 
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D. DECODING TOOLS 
1. Syndromes 



In chapter IV, we introduced the general syndrome 
calculation for the Reed-Solomon code by the equation; 



S^ = X R.a"^^ , j = 0,1,2. ..2t-l 

J ’.-A 



( 5 . 5 ) 



where a represents the primitive root of unity ( generator) 
and t represents the error correcting capability of the RS 
code. Referring to equation 5 . 5 , we will have four syndromes 
namely Sq, S^, S2, S3. The first syndrome, Sq, will show us 
the error amount occuring in the codeword. The last digit of 
the codeword, the one we appended for parity checking, will 
only be used in calculating the syndrome value Sq. We will 
not use the last digit for calculating the other three 
syndromes S]_/ S2 and S3. 

2 . Conditions of Syndromes in Case of Errors 

After calculating the syndromes, we will be able to 
examine the received codeword and make some decisions about 
it. From the syndromes Sq, S^, S2 and S3 it is possible to 
find and correct up to and including any two symbol errors 
according to the following descriptions. 



In case of no error, all the syndromes will be equal to 
zero. 



^0 “ ^1 “ ^2 “ ^3 “ ° 



( 5 . 6) 



In case of one error, the syndromes have the property of 



^0 “ ®k 



k _ 



St — 6}^a — ei,2 



So = eva^"^ = ei,2 



2k _ 



>2k 
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S — a — fa o3k 

^3 ®k^ ®k^ 

where ej^ is the error amount occuring in the k^^^ digit. So, 
the equality we will be looking for in case of one error is 

S;^/So = S2/S1 = S3/S2 = ( 5 . 7 ) 



Thus the location of the error is determined by S^^/Sq = 2 ^, 
the k^^ digit and the amount of the error is Sq = ej^. 

In case of two errors, the syndromes are given by 



So 


= ®k ®1 




Si 


+ 

0) 

II 


e^^a^ 


S2 




eia21 


S3 


= * 


eiaSl 


where the 


two errors are e^^ and ej_ occuring in the k^^ and 



1 ^^ digit of the codeword, respectively. The inequality we 
will consider in case of two errors is 



^1/^0 ^ ^2/^1 ^ ^3/^2 ( 5 -Q) 

That is, not all of S^/Sq, S2/S3 and S3/S2 are equal. In the 
next section we give the procedure to determine the loca- 
tions of the errors in case of two errors. 

E. DECIDING ON THE POSITIONS OF ERRORS 

Once the syndromes are calculated, we are able to decide 
on the positions of errors occuring (if any). But, it is 
required to have an equation to do that which will have only 
1 and k (the positions of errors) as unknowns. Such an 
equation can be obtained in the following way. 

The syndromes have been determined as 

Sq = e + d 
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e2l^ 


+ 


d2^ 


e22k 


+ 


d2^^ 


e23k 


+ 


d231 



where e and d represent the error amounts and k and 1 repre- 
sent the error positions, respectively. 

Now, we multiply the syndromes and forming 

S^S3= e^2^^+d22^^+ed( 

= e^ 2 '^^+d^ 2 '^^+ed 2 ^'^^( 2 ^^+ 2 ^^) 

Next we form the square of the syndrome S 2 ; 

(S2>^= e22^^+d^2^^+ 2ed2^'^^( 2^"^^) 

When we subtract ( S 2 ) from S 3 S 3 we obtain 

3^33 -( 32)^=ed2^'^^( -2( 2^2^ ) +2^ ^+2^^ ) 

And finally we get the equation 

3i33~32^=ed2^2^( 2^-2^)2 (5.9) 



In the following steps, we substitute e and d in equa- 
tion 5.9, so that we get an equation including only k and 1 
as unknowns. First, by referring to the first syndrome 3q, 
it can be derived that, e = 3 q - d. When we substitute this 
value of e in the second syndrome equation, we get 

(3q - d)2^ + d2^ - 33 = 0 

d( 2 ^ - 2 ^^) + 3q2^ - S 3 = 0 

3o, the value of d can be obtained as 
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( 5. 10 ) 



d = (S^^- 2^Sq) / ( 2 ^ - 2 ^) 

Now, we substitute the value of d obtained in equation 
5.10, into equation 5.9 yielding 

S1S3 - (S2)^ = e[(S3-SQ2^) / (2^-2^)] 2^^2^ ( 2 ^- 2 ^)^ 

= e(S3-SQ2^) 2^2^(2^-2^) 



Now using again the equation, e = Sq- d, the value of e 
can be calculated as;* 

e = {Sq2^ - S 3 ) / (2^ - 2^^) (5. 11) 



Now, we substitute equation 5.11 into equation 5.9; 

S3S3-(S2)2 = {Sq 2^ - S^){S^ - SQ2^)2h^ 

= (So2^ - S3) 2^ (S3 - Sq2^) 2^ 

Finally, we get the equation; 

SiS 3 -S^ 2 =( Sq2^^-S32^)( S32^-Sq2^^) 

As seen in the above equation, the only unknown terms are, 
the error positions k and 1. We use this equation to locate 
the error positions by a trial and error procedure in the 
subsequent chapter. 
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VI. IMPLEMENTATION 



A. BACKGROUND 

The fundamental and necessary concepts for constructing 
an RS(11,7) code are discussed in the previous chapter. Our 
findings from Chapter V are used in this chapter to imple- 
ment the RS(11,7) code. We use the generating polynomial, 

primitive root of unity, syndromes and the relationship 

\ 

between the positions- of the errors occuring as we found and 
discussed them in Chapter V. 

We also, in this chapter, provide the encoding and 
decoding algorithms in more detail and also illustrate some 
examples to show their application. 

B. ENCODING PROCESS 

As discussed in chapter IV, the RS codewords are formed 
as multiples of the primitive generating polynomial g(x). As 
g(x) is of degree r, there are n - r = k* information symbols 
which can be chosen freely. Then r check symbols are deter- 
mined so that the resulting codeword satisfies the criteria 
stated, namely that the codewords are multiples of the 
generator polynomial. In other words, the check digits are 
the coefficients of the remainder r(x) upon division of the 
information polynomial p(x) by g( x) as shown in example 6.1. 
Here we consider n as 10, as our codeword is of length 10. 
We also append the parity check digit after we calculate the 
first three check digits. So p(x) can be obtained as a poly- 
nomial of degree 9 and a parity check symbol appended. 

1. Encoding Algorithm 

1. Represent the Ml IN part of the stock number as 
coefficients of the . polynomial of degree 9. Call this 
polynomial p(x). Thus x^ * (the polynomial of degree 6 
representing the 7 digits of NIIN) is the 
representation. 
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2. Perform the required division p(x)/g(x) where g(x) is 
the generating polynomial having the value of - 3x^ 

+ X + 2 . 



3. Calculate the check digits as the coefficients of the 
remainder r(x) upon division of p(x)/g(x) and add 
these check digits to the right of the NIIN. 



4. 



Calculate the 

equation; 

9 




i = 0 



parity check 



(mod 11) 



digit/ 



by using the 



( 6 . 1 ) 



where p represents the parity check digit value and C 
vector of first 10 digits of the codeword. 



5. Append the parity check digit to the far right and 
output the 11 digit number as the encoded codeword C. 



Example Jl- J. : Now we will give an example of the encoding 

process by applying the encoding algorithm we developed. 
Suppose the NIIN part of the stock number is 0000001. If we 
represent it as the coefficients of the p(x), of degree 9, 
we obtain p(x) = x , and we satisfy step 1 of the algorithm. 
According to step 2, we perform the required division of 

O 

p(x)/g(x) and get the remainder r(x) = 3x - x - 2. So, this 

will give us the first three check digits as -3, 1 and 2. 

Now we add these digits to the right of NIIN and get C = 

0000001-312. Then applying the equation 6.^, we calculate 

the parity check digit p as -1, so that C- + p = 0. 

1=0 -‘- 

Appending this parity check digit to the far right, yields 
the encoded codeword 0000001-312-1. V7e use this encoded 
codeword in our decoding examples in this chapter and call 
it C. When C is received, with or without errors, it is 
called R. 
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Example 6.2 : Vie now provide another example to 
show how the required division is performed in a more 
complex case. Suppose the NIIN part of the stock number is 
9876543. The required division p(x)/g(x) can be seen in 
Figure 6. 1. Then, the check digits are obtained as A, 8, A 
(A represents number 10). Applying the encoding algorithm 
step 4, the parity check digit will be obtained as 7. So the 
output of the algorithm is the codeword R = 9876543A8A7. 

C. DECODING PROCESS 

The decoding process is, in general, much more compli- 
cated than the encoding process. Not only must we deal with 
the detection of errors but also with their corx'ection. 
Error detection is much easier than error correction. Since 
the code polynomial is a multiple of the generating polyno- 
mial p(x), the received code polynomial R(x) will be a code 
polynomial if and only if the remainder upon division of 
R(x) by p(x) is zero. 

There is only one condition for a valid codeword, that 
is the equality of all the syndromes to zero. This condition 
is the desired one we will be trying to satisfy throughout 
the entire decoding process. 

We will develop our decoding algorithm, considering the 
three possible conditions which are related with the errors 
occur ing in the encoded codeword. They are listed below: 

1. No error condition 

2. One error condition 

3. Two errors condition 

For the first condition, we have already shown that, all 
the syndromes are equal to zero. So, it is easy to determine 
the no error condition. The decoded (output) codeword will 
be the same as the received codeword. In case of one error, 
it is necessary to figure out the error position first. Vie 
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Figure 6. 1 Required Division for Example 6. 2 

will be using the equality we described for the first 
syndrome to decide the error position in our decoding 

algorithm. 
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f,.,Once the syndrome is calculated from the equation 



other equation described for calculating the first syndrome. 



error position, the necessary correction process is quite 
simple, as the actual error is equal to Sq, and it will be 
explained in the decoding algorithm. 

In case of two errors, the correction process is much 
more complex than the other two cases. Again, we must decide 
the error positions first. We use the equation described in 
step 10 of the decoding algorithm to do that. After finding 
out the error positions by trial and error method and the 
equation of step 10 , we use two different tables to deter- 
mine the error amounts occuring in the and l^''^ digits of 
the codeword, the determined error location points. The way 
to use these two tables will also be explained in the 
decoding algorithm. 

We follow a systematic procedure to describe the steps 
of the* algorithm so that, steps number two through four are 
related to the no error condition, steps number five through 
eight are related to the one error condition and steps 
number nine through fifteen handle the two error condition. 
Step number one, sixteen, seventeen and eighteen are the 
common steps and they are used each time the decoding algo- 
rithm is applied. Note: if it is determined that none of the 
three possibilities is operable then we say that more than 2 
errors have occured and a decoding failure is declared. 

1. Decoding Algorithm 

1. Compute the syndromes from the received word R; 




we can find out the error position using the 



S-j^ = R^e.^, where Rj^ = Sq and it is known. After deciding the 




2. If Sq = S^ = S 2 = S 3 = 0, then decide "NO ERROR". 



3. Set error vector E to all O's. 
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4. Go to step 16. 

5. If Sj^/Sq = 82 / 5 ^ = S 2 /S 2 then decide "OME ERROR". 

6 . Calculate k, the error position, using the equation; 



S^/Sq = k 

7. Set error vector E to; 







1 


I ° Ei 




Ek 




E = 








where i = 0 , 1 ,.. 






* 


So Si 

V. 


= 


Ek 


8 . 


Goto step 16. 










9. 


If not all of 


Si/Sq / S 2 /S 2 


/ 


S 3 /S 2 are equal 




then decide 


"TWO ERRORS". 






10 , 


Decide 1 and 


h. 


which correspond to the locations 



of errors occuring, using the following equation 



SiS3-( S2 )^=( Sq2^( + 2^"^+So2^( ) 

where k and 1 lie in the range of 0 to 9. 

11. Set i to 1 which corresponds to the index of 
of Table V. 

12. Pick the i^^ pair of e,d which corresponds 
to the error amount occuring using Table V. 

13. Try to satisfy the equation using the present 
values of e, d, k, 1 ; 

S]^ = e2^"^ + d2^“^ 

14. If the equation is not satisfied, then increment 
i by 1 and go to step 12 . 

If none of the values of i allow a solution 
then declare that more than 2 errors occured 
occured and a decoding failure results. 
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15. Set error vector E to ; 



E = 




_ Ei = Ej, 

Ej^ = Ei|_ where i = 0 , 1 , . . . , 10 
otherwise 



16. Output the codeword by subtracting the error 
vector from received vector. 



C - R - E - ( Rq-Eq^ . . , R]^q-Ej^q) 

17. Check syndromes again and verify that; 

^0 “ ^1 “ ^2 “ ^3 “ 

18. Then the information symbols are ; 

Cq, C-^, . . . ,Cq. 
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TABLE V 

POSSIBLE VALUES OF THE PARAMETERS 
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We illustrate the decoding algorithm we presented in 
this chapter by three examples. Each of these examples \ 7 ill 
correspond to one of the error conditions we discussed in 
the decoding algorithm. 

We use the received codeword R, as mentioned in 
example 6. 2 throughout the examples we will present in this 
section. In each case, the value of C is 0000001-312-1. 

We will first begin with the no error condition: 

Example 3 : Suppose we have received the codeword R = 
0000001-312-1. Referring to the decoding algorithm, first we 
calculate the syndromes in the following way: 



Received Codeword I 0000001-312-1 



Position #(k) | 0123456789 10 



Applying the decoding 
syndromes will be; 



algorithm step 1, the 



Sq=1-3+1+2-1 



= 0 . 



Si ===> 



-3 



2 ^ 2 ^ 2 ° 



8-122 ====> S]^ = 8- l + 2 + 2 = 0. 
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5 2 ===> 1-312 

2 ^ 2 ‘^ 2 ^ 2 ° 

9 -4 4 2 ====> 52 = 9- 4 + 4 + 2 = 0. 

5 3 ===> 1-312 

2 ^ 2 ^ 2 ^ 2 ° 

6-582 ====> 53=6-5+8+2=0. 

S i 3 T 1 C 0 ~ ^1 ~ ^2 ~ ^3 ~ ^ / ci 0 cici 0 ERROR.^ . 

S0t th0 0rror V0Ctor E to all Z0ros; 

E = 00000000000 

Applying st 0 p 16 of tho algorithm; 



R 



0 0 0 0 0 0 1 



-3 12-1 



E 



0 0 0 0 0 0 0 



0 0 0 0 



C = 0000001 -3 12 -1 



Check and verify the syndromes and output the veri- 
fied codeword as C = 0000001-312-1. The information is then 
determined as 0000001 . 
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Example 6. j4 : Suppose we have received the codeword R = 

0000011-312-1. Referring to the decoding algorithm-, first we 
will calculate the syndromes in the same way as in the 
previous example. 



I Received Codeword | 00000 1 1 -3 1 2 -1 



I Position #(k) | 0123456 -7 89 10 



Applying the decoding algorithm step 1, the 
syndromes will be; 

Sq=1+1-3 + 1 + 2-1 = 1. 

===> 11-312 
2^ 2^ 2^ 2 ^ 2 ® 

5 8 -1 2 2 ===> S 3 _ = 5 + 8-1 + 2 + 2 = 5. 

52 ===> 11-312 

28 2 ^ 2 ^ 2^ 2 *^ 

39-442 ===> 52=3+9-4+4+2=3. 

5 3 ===> 11-312 

2 ^ 2 ^ 2 ^ 2 ^ 2 ° 

4 6 -5 8 2 ===> 83 = 4 + 6 - 5 + 8 + 2 = 4 . 

The syndromes satisfy the condition; 

S^/Sq = S 2 /S]_ = S 3 /S 2 = 2^. Decide "ONE ERROR". 
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According to the decoding algorithm step 6; 

So = Ej, = 1 

S3_ = 5 = 2^ = ===> k = 5. 

So, decide the error position as position number 5 and the 
error as 1. 

Set the error vector E as described in step 7 of the 
algoritlim; 



E = OQOOOIOOOOO 

’Applying step 16 of the algorithm; 



R 



0 0 0 0 0 1 1 



-3 12-1 



- E 



0 0 0 0 0 1 0 



0 0 0 0 



V 



C = 0000001 -3 12 -1 



Check and verify the syndromes and output the veri- 
fied codeword as C = 0000001-312-1. Again the information is 
0000001. 
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Example 5 : Suppose we have received the codeword R = 
0001101-312-1. Referring to the decoding algorithm, we again 
calculate the syndromes in the usual way. 



Received Codeword | 0001101-312-1 



Position #(k) 1*0123456789 10 



Applying the decoding algorithm step 1, the 
syndromes will be; 

Sq=1+1+1-3+1+2-1=2 

===> 1101-312 
26 2^ 2^ 2^ 2^ 2^ 2® 
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Since the syndromes are not all zero and do not 
satisfy the condition; 

Sj^/Sq = S 2 /S-^ = S 3 /S 2 , decide "TWO ERRORS". 
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According to the decoding algorithm step 10; 

S^S3 - (82)^ = 2^ - (2®)^ = 2^-2^= 2^ 

We try to satisfy the equation; 

2’^=( Sq2 ^( ^~^)-S3_2^"^)( 532^"^- Sq 2 ^^ ) 

Substituting the values of Sq and S^/ and applying 
the trial and error method, the only pair of k and 1 would 
be calculated as; 

k = 3 , 1=4 

So, decide the error positions as position numbers 3 

and 4. 



According to step 12 of the algorithm, pick the 
first possible pair of the e and d using Table V and then 
try to satisfy the equation described in step 13; 

S;j^ = e2^"^ + d2^"^ 

2^ = 1*2^"^ + 1*2^"'^- 
2^ = 2^ + 2^ = 8 

2^ = 2^ 

So, it would take only one iteration to decide the 
errors, deciding e = d = 1. Set the error vector E as 
described in step 15 of the decoding algorithm; 

E = 00011000000. 

If e = d = 1 did not satisfy the equation, the next 
pair e = 3, d = A would be tried, etc. 
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Applying step 16 of the algorithm; 



R = 0001101 -3 12 -1 

E = 0001100 000 0 



C = 0 



0 



0 0 



0 



1 



-3 12-1 



Check and verify the syndromes and output the veri- 
fied codeword as C = 0000001-312-1. Again the information is 
given as 0000001. 

If more than 2 errors were made, then the above 
procedure would fail and a "decoding failure" would result. 
At this point a request for a retransmission would be initi- 
ated by the receiver. 
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VII. POSSIBLE INTERFACE FOR DATABASE APPLICATIONS 



A. BACKGROUND 

The size, power and number of database management infor- 
mation systems available and in use has grown dramatically 
in recent years. Business, industry and government seem to 
have been swept into the automation of data collections with 
a fervor akin to the automation of accounting systems during 
the late 1960's. As might be expected, people are once again 
discovering that the old adage of "garbage in, garbage out" 
continues to hold true. Although many of the errors in data- 
bases could probably be caught and corrected by appropriate 
error checking and correcting procedures, the cost for 
humans to perform such checking would be very high and the 
work very tedious. It makes sense to automate data storage 
and management, and remove the painstaking tasks for error 
checking and correcting from the human operator. 

Virtually all DBMS's (Database Management System) incor- 
porate in them some form of error checking facilities. 
[Ref. 7]. Typical checks are for proper data format 
(integer, real, alphabetic, etc. ), proper numeric sign, the 
correct number of data items and the presence or absence of 
data in certain fields. While these checks are important and 
help to prevent some errors, the number of situations in 
which major errors in databases have been found clearly 
indicates that these checks are not sufficient. The purpose 
of the error correction mechanism we described in the 
previous chapters is to provide a means of error checking 
which far exceeds the power of typical DBMS type of error 
checking. The error correction mechanism, we introduced, can 
be used both to perform error correction for pre-existing 
databases, which we term the checking of of a 'static' data- 
base, and to check proposed database updates before they are 
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passed on to the DBMS, which we term the checking of a 
' dynamic ' database. 

B. DATABASE ERRORS 

Database errors can be divided into four types: 
security, consistency, reliability and integrity. Security 
encompasses the control of all unauthorized access to the 
database. Both physical and logical means of access control 
are usually required. Consistency deals with the problems of 
errors which are introduced in the process of sharing data- 
bases. These can be 'due to either multiple users sharing a 
single database or multiple users sharing more than one copy 
of a database. A database can be inconsistent when multiple 
updates are processed out of sequence or the database 
changes during the course of a user providing an update. 
Under this definition, consistency checking involves only 
the specific data. Such checking would not invoke any of the 
"implied meanings" of the data, i. e the information in the 
semantic description of the database. Thus, a requirement 
such as an update of one item necessitating a corresponding 
update of another item is not a consistency problem [Ref. 8] 

Reliability refers to the problems of assuring that both 
the hardware and software components of the data management 
system perform as they were intended all of the time. 
Integrity errors include all types of errors which can be 
introduced due to active use of the database system. These 
may result from mundane sources such as typing or spelling 
errors, transmission errors which cause the data to be 
garbled or transformed between the original source and the 
database system, or user misunderstandings of the nature or 
content of the database. 

A computer has no built-in criteria which it can use to 
determine whether or not a given piece of data is correct in 
a given context. Thus, if a computer is to be used to 
detect and correct integrity errors, the computer must be 
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provided with such criteria. The error correction mechanism 
we described addresses only the detection and correction of 
primarily the last of these error types, integrity errors, 
although some consistency errors can be detected. 

C. QUERY INTERPRETER 

Before starting to explain the role of the query inter- 
preter which is the existing interface between the users and 
DBMS, it is necessary to identify the types of queries which 
could be given to the Supply System database. 

Depending upon th-e NATO countries in which this kind of 
interface exists, the number of user queries which can be 
given to the system might vary. In general, there is a range 
of twenty to thirty queries which can be found in a typical 
Supply System including queries for search, update, delete 
etc. However, one common part of all these queries is the 
NSN (National or Nato Stock Number). In other words, no 
matter what the query is, there should be a stock number in 
it, since the stock number is the only kev of the database. 
Thus, it would be a pretty good idea to check and then 
verify the NSN before it is passed to DBMS for processing 
according to the given query. This kind of checking can be 
obtained via a detection and correction routine which would 
be added to the query interpreter as a component. 

As shown in Figure 7.1, the checking component of the 
query interpreter acts as a filter between users of the DBMS 
and DBMS itself. This checking component is a 'passive' 
filter in the sense that it is not visible to the DBMS users 
until a potential error is detected and corrected. However, 
whenever an error is detected and corrected, the system 
informs the user about the correction has been made, or if 
the number of errors are more than two it declares a 
decoding failure and requests a retransmission or another 
query. 
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I 

I 



r 1 




Possible Interface Supported 
with a Correction Routine 



Figure 7. 1 Possible interface between user and DBMS 

Generally, writing detection routines is not difficult. 
If the structure is sufficiently well understood for update 
and access routines to be written, then the detection 
routine can likely also be written with about the same 
effort. The implementation of a correction routine is more 
difficult than implementation of a detection routine. But, 
for this particular application, it becomes relatively easy 
using the decoding algorithm we presented in Chapter VI. 

Recalling the construction of a NSN from Chapter I, 
there are thirteen digits in a NSN. After the encoding 
process, there are added four more digits as check digits 
which makes the length of the codeword seventeen digits. 
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Wheri this encoded codeword is received with or without error 
by the query interpreter, it is the correction routine's 
responsibility to check and verify it using the check digits 
and the decoding algorithm described in Chapter VI. In this 
way, terminal operator's mistakes as well as the other type 
of mistakes discussed as integrity errors can be detected 
and corrected. After verifying the received codeword 
(encoded NSN), the check digits are removed and the output 
of the correction routine gives the original thirteen digit 
NSN. Then this NSN and the interpreted query are passed to 
the DBMS for processing. Using this kind of interface and 
error checking and correcting mechanism together does not 
require any change in the construction of the physical data- 
base, and it provides a more efficient system in terms of 
reliability, integrity and time. 
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VIII. CONCLUSION 



In this thesis we have taken a modular approach to the 
implementation of Reed-Solomon code in order to provide an 
error correction mechanism for the existing National or NATO 
Supply systems. By initially discussing algebraic coding 
theory and finite field theory, we have shown that they play 
an integral part in the overall implementation. The imple- 
mentation theory is represented first because of its neces- 
sity to understand the implementation more easily. It is 
then followed by the design of the encoding and decoding 
algorithms which provide two error correction for the 
National or NATO Stock Number (MSN). 

After defining the approach and the associated algo- 
rithms, it is then followed by a possible database inter- 
face. Thus the user of a database system can be supported by 
this kind of interface and database system itself becomes 
more reliable and efficient. The most common problems and 
general types of errors we have presented in the previous 
chapter showed that, when human operators get involved with 
the operation of these kind of systems, numerous types of 
errors should be expected. Encoding and decoding algorithms 
we presented in this thesis are developed based on the facts 
that an algorithm should be satisfied in order to detect and 
correct possible types of human operator errors without 
costing more in terms of money and personal effort. Because 
we can correct up to two errors, the reliability and integ- 
rity have improved. 

It is hoped that, with this thesis as a guide, some 
interested supply officers or other officials will make the 
necessary changes in the Supply System Database in order for 
increasing its reliability and efficiency. 
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